Kodak consumer video benchmark data set : concept definition and annotation
نویسندگان
چکیده
Semantic indexing of images and videos in the consumer domain has become a very important issue for both research and actual application. In this work we developed Kodak’s consumer video benchmark data set, which includes (1) a significant number of videos from actual users, (2) a rich lexicon that accommodates consumers’ needs, and (3) the annotation of a subset of concepts over the entire video data set. To the best of our knowledge, this is the first systematic work in the consumer domain aimed at the definition of a large lexicon, construction of a large benchmark data set, and annotation of videos in a rigorous fashion. Such effort will have significant impact by providing a sound foundation for developing and evaluating large-scale learningbased semantic indexing/annotation techniques in the consumer domain. This report includes information about the concept definitions, the annotation process, video collection process, and the data structures used in the release file. The released dataset includes the annotations, extracted visual features (for videos from Kodak), and URLs of videos from YouTube. The Appendix section also includes the full list of concepts (more than 100 concepts in 7 categories) that have been defined in the consumer video domain.
منابع مشابه
Building a Large Annotation Ontology for Movie Video Retrieval
Multimedia content continues to grow rapidly. To ensure access to growing video collections, semantic indexing of images and videos has become a very important issue for data access, retrieval and actual application. For developing and evaluating semantic concepts searching annotation techniques, it is necessary to predefine a large lexicon, construction of a large benchmark data set, and annot...
متن کاملVIREO-374: LSCOM Semantic Concept Detectors Using Local Keypoint Features
Semantic concept detection aims to rank video shots in large scale video corpus according to the presence of a specific concept, such as ``sports'', ``charts'', ``people marching'', and etc. In recently years, mainly motivated by the NIST TRECVID [1] which provides common video data and benchmark evaluation, a number of successful concept detection systems have been developed. As manually annot...
متن کاملRobust Semantic Video Indexing by Harvesting Web Images
Semantic video indexing, also known as video annotation, video concept detection in literatures, has attracted significant attentions recently. Due to the scarcity of training videos, most existing approaches can scarcely achieve satisfactory performances. This paper proposes a robust semantic video indexing framework, which exploits user-tagged web images to assist learning robust semantic vid...
متن کاملOnline Multi-Label Active Learning for Large-Scale Multimedia Annotation
Existing video search engines have not taken the advantages of video content analysis and semantic understanding. Video search in academia uses semantic annotation to approach content-based indexing. We argue this is a promising direction to enable real content-based video search. However, due to the complexity of both video data and semantic concepts, existing techniques on automatic video ann...
متن کاملImageCLEF 2014: Overview and Analysis of the Results
This paper presents an overview of the ImageCLEF 2014 evaluation lab. Since its first edition in 2003, ImageCLEF has become one of the key initiatives promoting the benchmark evaluation of algorithms for the annotation and retrieval of images in various domains, such as public and personal images, to data acquired by mobile robot platforms and medical archives. Over the years, by providing new ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008